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ABSTRACT 

We present a new, galaxy-halo model of large-scale structure, in which the galaxies 
entering a given sample are the fundamental objects. Haloes attach to galaxies, in 
contrast to the standard halo model, in which galaxies attach to haloes. The galaxy- 
halo model pertains mainly to the relationships between the power spectra of galaxies 
and mass, and their cross-power spectrum. With surprisingly little input, an intuition- 
aiding approximation to the galaxy-matter cross-correlation coefficient R(k) emerges, 
in terms of the halo mass dispersion. This approximation seems valid to mildly non- 
linear scales (fc < 3 hMpc~ ), allowing measurement of the bias and the matter 
power spectrum from measurements of the galaxy and galaxy-matter power spectra (or 
correlation functions). This is especially relevant given the recent advances in precision 
in measurements of the galaxy-matter correlation function from weak gravitational 
lensing. The galaxy-halo model also addresses the issue of interpreting the galaxy- 
matter correlation function as an average halo density profile, and provides a simple 
description of galaxy bias as a function of scale. 

Key words: large-scale structure of Universe - galaxies: haloes - cosmology: theory 
- dark matter - galaxies: formation - gravitational lensing. 



1 INTRODUCTION 

The halo model of large-scale structure has been quite suc- 
cessful in interpreting observations of galaxy and mass clus- 
tering. In the halo model, the dark matter in the Universe 
consists entirely of virialized clumps called haloes. The as- 
sumption is made that galaxies can only form within these 
haloes, with the number of galaxies per halo depending (pri- 
marily) on the mass of the halo, according to a halo occu- 
pation distribution (HOD). 

Some of the inputs to the halo model arise from pure 
theory, such as the linear power spectrum. Others come from 
theory with some calibration with simulations, such as the 
mass spectrum and bias of the haloes. Yet others are en- 
tirely empirical, such as the HOD, and halo density profiles. 
In fact, alth ough ideas related to the halo model have existed 
for de cades jNevman fc ScotJl952l : IScherrer fc Bertschingerl 
ll99lT) . it was not until a unive rsal halo density profile 
was discovered from simulations iNavarro. Frcnk & White 
Il99ft that the halo model b egan t o really develop 
iPeacock fc Smithl 1200(1 ISeliakl l2000t iMa fc Fry! 120001 
IScoccimarro et a lTl200ll: iBerlind fc Weinberd l2002ju For a 
review, see ICoorav fc Shethl J20021) . There are some things 
about the halo model which unsettle us somewhat; for in- 
stance, the cosmic web seen in observations and numerical 
simulations is not directly explained in the halo model as 



it currently stands. However, the halo model has matured 
to the degree that it see ms to be able to h elp constrain 
cosmological parameters dSeliak et al.ll2005allrl . As obser- 
vations and simulations improve, it is likely that the halo 
model will evolve to match observables from them arbitrar- 
ily well as theoretical ingredients are added to it, without 
fundamental conceptual changes. 

In this paper, we approach the problem of interpreting 
large-scale structure observations with a different philoso- 
phy, investigating how much can be learned with the obser- 
vations themselves, with as little theoretical input as possi- 
ble. The galaxy power spectrum P gg is taken as the funda- 
mental quantity, even though it is not known how to produce 
it theoretically at present. Haloes (or subhaloes; they are not 
differentiated in the model) are attached to galaxies, not the 
other way around. The galaxy-matter power spectrum P gm 
and the matter power spectrum P mm (or, more accurately, 
their two-halo terms) are then simple convolutions of P gg 
with average halo density profiles. Some added information 
about large-scale bias and the halo mass dispersion yields a 
surprising amount of information about the galaxy-matter 
cross-correlation coefficient, the bias, and the cross-bias. 

We emphasize that this galaxy-halo model is not meant 
to be a competitor to the standard halo model, which could 
be called the 'largest virialized' halo model, since each halo 
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in the standard halo model is the largest possible virialized 
structure. The galaxy-halo model as presented in this paper 
is not a formalism which may be used to compare to all 
conceivable observations, although it may evolve to be more 
encompassing in the future. It is currently restricted because 
it has few ingredients, and it is not known how to obtain 
the main ingredient P gg a priori. In fact, we are surprised 
that because it is simpler, the galaxy-halo model was not 
developed before the halo model. The above criticism of the 
halo model (that it does not produce the cosmic web) applies 
to the galaxy-halo model, too. However, the simplicity of the 
galaxy-halo model has benefits; it allows for some intuitive 
insights, and for interpretation of some observations with 
few assumptions. 



2 RATIOS OF CLUSTERING STATISTICS 

It can be useful to form ratios of two-point statistics 
(the power spectr u m and the cor relation function; see 
e.g. IPeeblesl 1198(1 iHamiltonl 1200.4) which measure the 
clustering of matter and galaxies. In Fourier space, the 
(squared) bias is b 2 (k) = P gg (k)/ P mm (k), the ratio of 
the galaxy power spectrum to the matter power spectrum. 
The (squared) galaxy-matter cross-correlation coefficient is 
R 2 (k) = P gm (k) 2 /[P gg {k)Pmm{k)], where P gm is the galaxy- 
matter cross power spectrum. With the advent of galaxy- 
matter observations from weak lensing, the 'cross-bias' has 
also been used: b/R = P gg (k)/ P gm (k). 

The same ratios may also be defined in real space, using 
correlation f unctions instead of p ower spectra; recent obser- 
vations (e.g. ISheldon et al.ll2004t) have tended to favor the 
real-space description. There are at least two good reasons 
for this: £ gm , and not P gm , is more directly measured from 
weak-lensing observations; and also, £ is easier to understand 
intuitively and visually. 

However, the theoretically preferred representation is 
in Fourier space, for several reasons. Most relevantly for the 
present paper, the Schwarz inequality imposes a mathemati- 
cal constraint on the cross-correlation coefficient R(k) when 
expressed in Fourier space. The Schwarz inequality requires 

that (|5 m (k)<5 9 (-k)|) 2 < (l<5 9 (k)| 2 )(|Mk)| 2 ). V cmd only if 
the galaxy and matter power spectra include the shot noise, 
this gives 



R 2 (k) 



Pgm{k) 



Pgg (^)fmm (&) 



< l. 



(i) 



The shot noiSG 111 Pmm is negligible because dark-matter 
particles are practically infinitesimal on all astrophysically 
relevant scales. However, the shot noise in P gg may be com- 
parable to the galaxy clustering signal, and so including or 
excluding the shot noise in P gg makes a significant differ- 
ence. 

The galaxy-matter cross-correlation coe ffi cient 
R h a s been discussed jDekel_^^ahavJ_^22|; IPerJ 
Il998l: iTeemark fc Bromlevl ll99Sl iTaruva fc Sodal Il999t 

eliak & Warrc nl l2004h as a measure of the stochasticity, or 
scatter, in the relationship between the galaxy overdensity 
8 g and the matter overdensity 5 m . This interpretation of 
R is clearest when the Fourier-space representation of R 
is used, and when the shot noise is included in P gg ; thus, 
we advocate defining R in this manner. Then R(k) has a 



straightforward physical meaning: it is unity on large scales 
where 5 g (k) and 8 m {k) are simply related, and decreases 
on small, nonlinear scales where significant scatter exists in 
the relationship. 



3 THE STANDARD HALO MODEL 

Now we w ill outline the standard halo model, d rawin g pri- 
marily on ICoorav fc Shethl ll2002Tl and ISeliaJd foOOtf) . For 

the matter power spectrum P mm (fe), the following ingredi- 
ents are necessary: the linear power spectrum Pii n (fc), the 
number density of haloes as a function of mass n(M)dM, 
the large-scale bias as a function of halo mass, b(M), and 
the average Fourier-transformed density profile of a halo of 
mass M, y(k,M). This density profile is normalized to be 
unity at k = 0: 



2/(k, M) = — 



p(r, M) exp(— ik • r)d r, 



(2) 



where p(r, M) is the average real-space density profile of a 
halo of mass M. For a spherically-symmetric density profile 
p(r, M) , this becomes 



y(k,M) 



47T 

M 



sin kr 
kr 



p(r, M)r dr. 



(3) 



The power spectrum is a sum of one-halo (lh) and two-halo 
(2h) terms: 



Pmm (^v 



where 



Pmm (^) 



Pmm(k) + _P mm (/c), 



y(k,M) 2 ^ldM, 



and 



Pmm(k) — Plin{k) 



b(M)y(k,M)-± — '- — dM 



(4) 



(5) 



(6) 



Here, p is the average matter density. 

With galaxies come a few more ingredients: a halo occu- 
pation distribution (AT ga i|M) giving the distribution of the 
number of galaxies inside a halo of mass M, giving a total 
galaxy number density ra ga i, and a quantity y ga i(fc), which 
describes the average galaxy density profile of a halo, in 
general different from its matter density profile. The galaxy- 
matter power spectrum P gm and the galaxy power spectrum 



P g m(k) 



y(k,M)y gaX (k,M) p - 1 
(JVgailM) n(M)M 



rigal p 

Pun(k) I Vfn(k,M) 



dM; 
(N s ,i\M) 



b{M)n{M)dM 



y(k, M)b(M) ^—^-dM; 



y(k,M)y sal (k,M) p 
(iV gal (jV ga i-l)|M) 



n(M)MdM; 



(7) 



(8) 



(9) 



'gal 
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2/gal(fe,M) 



(^gal|M) 



b{M)n{M)dM 



(10) 



Here, p is 2 if there is more than one galaxy per halo, and 
1 otherwise; this arises from an assumption that there is a 
galaxy at the centre of each halo of sufficient mass. (See 
ICoorav fc Sheth1l2002l for more explanation.) 



4 THE GALAXY-HALO MODEL 

A 'galaxy halo' is defined in this paper to be a dark-matter 
halo around a galaxy. As in the standard halo model, all 
power spectra in the galaxy-halo model are sums of one-halo 
(lh) and two-halo (2h) terms. These could be called one- 
galaxy and two-galaxy terms, since the fundamental objects 
in the galaxy-halo model are galaxies. However, galaxies in 
the galaxy-halo model are pointlike objects. It is in their 
haloes that matter is found, and where the galaxy-matter 
and matter power spectra measure the clustering. 

Even though the lh and 2h terms in the galaxy-halo 
model share labels with their counterparts in the standard 
halo model, the terms in the two models differ conceptually, 
and generally differ numerically as well. That is, the lh and 
2h terms get different shares of the total power spectra in 
the galaxy-halo model than in the standard halo model. 

Since the galaxy power spectrum P gg is one of the in- 
puts of the galaxy-halo model, the expressions for it are 
extremely simple. The two-halo term Pgg is just the galaxy 
power spectrum without shot noise. To emphasize that it is 
a fundamental input into the model, we simply denote Pgg 
as P. There is also a one-halo term in the full P gg , which 
is the shot noise 1/n, where n is the total number density 
of galaxies. So, the two terms in P gg , with the shot noise 
included, are 



P 3g (k) = P%(k) + P£(k) = P(k) + 1/n. 



(11) 



Dark-matter halo density profiles affect the other power 
spectra, P gm and Pmm- First we discuss what they are in 
a simple, single-mass model, and then in a more realistic 
model with a distribution of halo masses. 



4.1 Single-mass model 

In this section appears a highly simplified model, some of 
the results of which carry over to the next, more realistic 
model. Consider a population of galaxies of number density 
n and power spectrum P(k), and with haloes of identical 
masses and identical, spherically symmetric density profiles. 
The density profile p(r) Fourier-transforms into y(k), in the 
same manner as in eq. The density profile must be well- 
behaved in the sense that as r — > oo, p(r) — > in such a way 
that y(k) -> 1. 

Both the lh and 2h terms of the galaxy-matter power 
spectrum depend on the Fourier-transformed density profile 
y(k). The pairs comprising the lh term are galaxies with 
dark-matter particles in their own haloes: 

P gm (k) = n- 1 y(fc). (12) 



The 2h term is the power spectrum of galaxies with dark- 
matter particles in other haloes; it is a product in Fourier 



space of the galaxy power spectrum P(k) with the average 
halo profile y(k). 

Pgl{k) = P(k)y(k). (13) 

The matter power spectrum is similarly defined: 

P m h m {k) = n-'yikf, (14) 

Pm m (k) = P(k)y(k) 2 . (15) 

This simple model yields simple formulae for the bias b(k) 
and the cross-correlation coefficient R(k). The (squared) 
bias, excluding the shot noise from P gg , is 



b- S n(fc) 



P(k) 



Pl h m {k) + P^ h m (k) P(k)+n 



PW T y{k)-\ (16) 



If the galaxy power spectrum includes the shot noise, the 
bias is simply 



b 2 (k) =y(kY 



(17) 



The equation for R 2 (k) (where P gg includes the shot noise) 
is even simpler: 



R 2 (k) = 



ffi(*) + ^(*)f 



[p(k) + n-^\[p^ m {k) + p&m 



= i. 



(18) 



This makes sense: galaxies and matter are perfectly cross- 
correlated if all galaxy haloes are identical. 



4.2 Multiple-mass model 

Now, more realistically, suppose that there is a distribution 
of halo masses and shapes (which need not be spherically 
symmetric). As a pedagogical aid to those familiar with the 
halo model, we will point out how various terms change or 
disappear using the galaxy-halo model. In doing this, we do 
not mean to imply that the galaxy-halo model is a subset of 
the halo model, in which additional assumptions are made. 
The fundamental assumptions of the two models differ. 

In the halo model, galaxies are put into haloes, while in 
the galaxy-halo model, haloes are put around galaxies. So, in 
the galaxy-halo model, the HOD (which does not explicitly 
appear) is identically 1, and y ga i is unnecessary. The galaxy 
power spectrum P is the fundamental quantity, so Pn n does 
not appear. The bias as a function of M, b(M), also is not 
needed, since it is subsumed into P. However, there is still a 
large-scale bias bo multiplying the three 2h terms; as k — > 0, 
P^ik) = b P 2 gm {k) = b 2 Pl h m (k)- 

Let the total number density of galaxies be denoted 
n — J n(M)dM, and the total mass density be denoted 
p = J n(M)MdM. Here, n(M)dM is the number density 
of galaxy haloes of a given mass; henceforth, n denotes the 
total galaxy number density unless it explicitly appears as a 
function of mass. Not surprisingly, the expressions for P gm 
and Pmm become more complicated when there is a distri- 
bution of masses. It is still straightforward, though, to ex- 
press the lh terms. The average Fourier-transformed density 
profile as a function of mass y(k,M) is the average of y(k) 
over haloes of mass M, and the mean-square y 2 (k,M) is 
the average of y(k) 2 . Assuming a large enough volume that 
departures from isotropy vanish, these average halo profiles 
must be spherically symmetric, and thus have zero imagi- 
nary components. The density profiles comprising P gm are 
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Figure 1. For two different halo populations, comparisons of 
the average Fourier-transformed density profiles y gm , !/mmi 
and , as measured from J V-bod y simulations described by 

iNevrinck. Hamilton fc Gnedinl fc004l The mass threshold defin- 
ing the top panel's halo sample is 3 X 10 13 h~ 1 Mpc; the bottom 
panel's threshold is smaller by a factor of 512. The lh profiles 
were obtained from P gm and P^ m (measured by 
Fourier-transforming ^ m and §i?m) using eqs. fL9l and pi). The 
lh profiles yV^ and were measured from P~* , P~t„ , and 



2/gm and 



P using eqs. 1241 and 1251 . These latter two equations require an 
estimate of the large-scale bias bo for each halo sample. For the 
top panel, we fitted bo = 1 by hand, and for the bottom panel, 
we used the value bo = 0.75 used for Fig.|H] which is a plot of the 
bias from the same halo sample. 



averaged weighting by the product of the number density of 
galaxies and the mass density: 



Pgrn(k) = — / y(k,M)n(M)MdM = n^y^k), 
np J 

where y g „ is a mass-weighted average halo profile 



V g m(k) 



J y{k,M)n(M)MdM 
J n(M)MdM ' 



(19) 



(20) 



The density profiles comprising P}^ m are averaged weighting 
by the mass squared: 



Pmm\k) 



, y 2 (k,M)n(M)M 2 dM=^y 1 J: m (k) 2 , (21) 



bias factor of a set of haloes v aries with their ma ss. This 
behaviour can be modeled (e.g. lMo fc WhiteHl996Tl . but to 
our knowledge, such a model has not been formulated which 
counts subhaloes as haloes, as in the galaxy-halo model. 
Even if there were such a model, adding it to the galaxy-halo 
model would cause a significant increase in the galaxy-halo 
model's complexity, which we wish to avoid. 

However, one thing may be safely assumed about the 
2h terms: on large-enough scales (where the lh terms are 
negligible, and all average Fourier-transformed density pro- 
files are unity), the cross-correlation coefficient R(k) = 1. 
To see this, suppose the universe consists of regions large 
enoug h so that the galaxy bias in each region is entirely lo- 
cal dColeslll993l) and is statistically independent of the bias 
in other regions. In the limit k — » 0, a measurement of R(k) 
requires averaging over a number of regions which goes to 
infinity, squeezing the variance of the biases in different re- 
gions to zero. This gives R(k — > 0) = 1. Assuming as much, 
the following equations hold: 



P 2 m {k)=P{k) 



ygm(k) 



v 2h 



(AO 



bo 



(24) 
(25) 



Here, bo = lim^o \J P(k)/ P^tin(k) is a large-scale bias (usu- 
ally of order unity) and y gm (k) and ymm{k) are effective av- 
erage halo density profiles, defined to be unity as k — > 0. 
These average halo density profiles have no imaginary com- 
ponent, by construction, since they are defined in eqs. 1241 
and 125H in terms of other real quantities. 

In general, y lh differs from y 2h (for both galaxy-matter 
and matter power spectra) because halo density profiles 
change systematically with the clustering strength of the 
haloes, and y gm differs from y m m (for both lh and 2h terms) 
because the y mm is a root-mean-square average, whereas 
y gm is a straight average. Even though these terms typi- 
cally differ, below we will explore what emerges under the 
approximations that y lh = y 2h and y gm = y mm . 

Figure compares these four average 



Fourier- 



transformed density profiles y gr ^, mm , for galaxies placed at 
the centres of two different sets of haloes from iV-body sim- 
ulations, characterized by large (top) and small (bottom) 
mass thresholds. For each panel, the profiles do not corre- 
spond exactly, but they are similar. The agreement is better 
using a large mass threshold because excluding small haloes 
narrows the distributions of halo masses and density profiles. 

Assuming that y^ = y 2 g t = y gm , and y^ m = y^ m = 
ymm, the expressions for P gm and P mm simplify: 



P gm (k) » [P(k)/bo + l/n]y g m(k); 
P mm {k) « [P{k)/bl + p/n]y m m{k) 2 . 



(26) 
(27) 



where 



lh 

Vmm 



1 J y 2 (k,M)n(M)M 2 dM 



(22) 



' / n(M)M 2 dM 

and jj, is a dimensionless mean-square halo mass, 

At = (M 2 )/{M) 2 > 1. (23) 

Not as much can be said about the 2h terms with the 
assumptions made so far, primarily because the large-scale 



With these power spectra in hand, it is possible to cal- 
culate ratios between them: the bias b(k), the cross- bias 
b/R = P gg /P gm , and the cross-correlation coefficient R(k). 



Here are equations for the bias (squared) b 2 
excluding the shot noise from P gg : 

2 d 
fe_ sn (fe) = 



Pgg/P" 



P(y^ m /bo) 2 + {p/n)(y^m) 2 



P/b 2 +l i/n 



(28) 
(29) 
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Equation 1291 follows if y^ m = y 
included in P gg , then eq. 1291 becomes 



If the shot noise is 



b 2 {k) 



P + n~ 



P/bl + ft/n- 



(30) 



Equation 1291 contains the general features in the bias 
(excluding the shot noise from P gg ) which have been ob- 
served from simulations and from the halo model (Seljak 
2000). On large scales, the halo profiles in real space go to 
zero [and thus y mm (k) — > 1], and the galaxy-matter two-halo 
term overwhelms the one-halo term [P(k)/bo ^> fx/n]. Thus, 
as expected, the only signal on large scales is the large-scale 
bias, and 6_ sn (fc) = bo. On intermediate scales, where the 
two-halo and one-halo terms in the matter power spectrum 
are comparable [P(k)/b 2 ~ p/n], assuming that y mm (k) ~ 1 
still holds, the bias decreases. On small scales, the one-halo 
term dominates \P(k)/b% <C fl/n], and so the behaviour of 
the bias depends on whether P(k) or y m m{k) 2 decreases 
faster as k — > oo. Generally, j/ mm (fc) 2 decreases faster, forc- 
ing the bias upward. If P gg includes the shot noise, then the 
bias always increases with k at small scales. 



The cross-bias b/R, keeping the shot noise in P gg , is 



b_ 

R 



P + n' 



Pyl^/bo + n-^yW 



P + n 



P/b + n 



_l Vgm 



(31) 
(32) 



Vgm 



Equation follows if y\ m 

A formula with few inputs also emerges for the cross- 
correlation coefficient R 2 = P gm / 1 (P gg Pmm)- Including the 
shot noise in P a „, 



(Py gm /bo + n y gm ) 



R 2 {k) = 



(P + n^)[P{y* 

(P/bp + n- 1 ) 2 
(P + n^)(P/b 2 + p/n) 

(P/bp + n- 1 ) 2 
(P + n~i)(P/b 2 + n/n) 



/b ) 2 + Wn)(yi 

2 

Vgn 



y-inm 



(33) 



(34) 



(35) 



Equation 1341 is true if ymm = y^m (and similarly for y gm ), 
and for eq. 1351 . a more extreme assumption is made, that 
y gm = ymm- Since y gm is weighted by mass, and y mm by 
mass squared, the latter assumption will only be valid if halo 
profiles do not depend on mass, which is almost certainly not 
the case. Excluding the shot noise in P gg turns (P + n" 1 ) 
into P in the denominator in eqs. (13411 and 1351 . 

In eq. (1351 . on large scales, P dominates both n -1 and 
(p/n), so R 2 approaches unity. On small scales, P is small, 
and so R 2 approaches 1/fi. More specifically, under these 
approximations, R 2 is an interpolation between 1 (its 2h 
value) and 1/p (its lh value), weighted by the respective 
matter terms: 



R 2 (k) : 



Ug n 



(P + n- 1 )(P, 
° 2h + <l/u)P lh 

mm i \ / Pv rni 



+ P 1 

\ 1 rr. 



(36) 
(37) 



where we assume that y x g % 
terms). Here, e(6o) is small if bo 



It m in 



(and similarly for the 2h 
1: 



-(bo) = 



{bo - I) 2 



(38) 



l + nP + fibl[l + l/{nP)Y 

Although eq. 1351 is exactly true in the more general 
case of zero variance in halo density profile shape, it may 
be helpful to interpret eq. 1351 by visualizing the haloes 
as a collection of nuggets (instead of extended haloes) of 
varying mass. Consider the smallest scales, k > 7r/r m i n , 
where r m j n is the smallest intergalactic distance. Here, the 
Fourier-transformed galaxy and matter overdensities 5 g (k) 
and 6m(k) sample at most one galaxy, and so R 2 (k) sees only 
the mass dispersion of galaxies. This is where the galaxy 
power spectrum P <C 1/n, and thus R 2 — > in eq. 1351 . 
At progressively larger scales, R 2 samples more and more 
galaxies, until at the largest scales, so many galaxies en- 
ter the average that the scatter in the 5 g -to-S m relationship 
vanishes, pushing R 2 — > 1. 



4.3 Orphans 

Up to now, we have only defined a galaxy halo as a clump of 
dark matter (surrounding a galaxy) whose density falls off to 
zero at large radius. In practice, one way to define a galaxy 
halo sample could be to populate a set of bound dark-matter 
haloes and subhaloes (from a simulation, for example) with 
galaxies. This set of galaxy haloes could be characterized by 
a bound-mass or circular-velocity threshold, for example. 
But what about orphaned matter particles which are not 
bound to any galaxy halo? Such matter exists not only in 
voids, but in small, isolated haloes which do not meet the 
criteria to host a galaxy and are not bound to any larger 
galaxy haloes. 

One way to deal with orphans is to adopt them into 
galaxy haloes, removing the condition that galaxy haloes 
must be gravitationally bound. However, it is not clear how 
to partition the unbound matter into galaxies, and alter- 
ing the partition could significantly affect the dimensionless 
mean-square halo mass /x. 

In this paper, we exclude orphans from galaxy haloes, 
but they cannot be completely ignored. Even under the as- 
sumption that orphans do not affect clustering properties, 
excluding them in the calculation of the matter and galaxy- 
matter power spectra results in the wrong normalization. If 
Pgm and Pmra are calculated using only non-orphans, then 
the multiplicative 'orphan factor' ph/p must be applied once 
to P gm and twice to P mm to get the normalization right. 
Here, ph is the density of matter in galaxy haloes, and p 
is the total matter density. 'Orphan factors' must be used 
with the bias or cross-bias, but they cancel out for the cross- 
correlation coefficient R. 

Even if orphans are excluded, the question of partition- 
ing the matter into galaxy haloes can be ambiguous. Power 
spectra care only about density contrasts, not about whether 
matter is bound to galaxies, so boundedness is not necessar- 
ily the right test to determine galaxy halo membership. 

In the context of the galaxy-halo model, the best parti- 
tion of matter is the one which makes the predictions of the 
galaxy-halo model work best. There are a few ways of judg- 
ing matter partitions using this criterion. One of them is to 
make the approximations y^ m = y 2 m and y^ m = y^ m hold 
as closely as possible, but this is hard to test over a wide 
range of halo samples. The easiest meaningful quantity to 
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measure from a partition is the dimensionless mean-square 
mass fi. Measuring R 2 (k), which is independent of the par- 
tition, and looking at its typical small-scale value, gives an 
idea of the 'natural' dimensionless mean-square halo mass. 
However, as displayed below in Figure |SJ if haloes extend 
out to a scale where P gg is significant, R 2 may stay above 
its characteristic small-scale value, making /i hard to deter- 
mine from R 2 . There is some ambiguity in how precisely the 
matter should be partitioned, but that is not necessarily a 
bad thing, since the observed power spectra cannot depend 
on the partition. 



5 TESTS 

Can the galaxy-halo model be used to extract meaningful 
information from observations? The galaxy-halo model con- 
tains three items which are potentially useful: simple de- 
scriptions of the galaxy-matter bias and cross-correlation 
coefficient; and the capacity to separate out lh and 2h terms 
from an observed galaxy-matter power spectrum. This sec- 
tion describes tests of these items, and also explores how 
predictions of the galaxy-halo model vary with properties of 
a galaxy-halo population. 

5.1 Mock halo catalogs 

5.1.1 Isolating f 9 JJ, 

The galaxy-matter correlation function £ gm is sometimes in- 
terpreted as a measure of the average overdensity profile 8{r) 
of haloes around galaxies. However, the one-halo may 
be a more appropriate measure of the average overdensity 
profile of galaxy haloes, since including (jjj, would double- 
count matter in overlapping regions. The lh and 2h terms 
of £ 9 m are not observable by themselves; only their sum is. 
The galaxy-halo model for P gm allows removal of an effec- 
tive two-halo contribution to the galaxy-matter correlation 
function £, gm , if the galaxy correlation function £ 99 is known 
as well. 

In this section, we discuss a test of how well the average 
overdensity profile S(r) can be measured from £ 99 and £ 9m 
within the framework of the galaxy-halo model. For galaxy 
positions, we used the centres of dark-matter haloes from 
a 256 h' 1 Mpc, 256 3 -particle ACDM dark-matter-on ly N- 
body simulation iNevrinck. Hamilton fc Gnedinl2004l here- 
after NHG). To detect th e haloes, we used the halo- 
findin g algorithm VOBOZ iNevrinck. Gnedin fc Hamilton! 
2005), with a density threshold of 100 times the mean den- 
sity. All haloes exceeded a 2a VOBOZ probability threshold. 
The closest pair of galaxies was separated by 0.7 h~ x Mpc. 

Around the galaxies, we p ut identically shaped NFW 
iNavarro. Frenk fc White! Il997l) profiles with scale radii of 
1 h^ 1 Mpc, all truncated at a deliberately large radius of 
8 h" 1 Mpc. At this truncation radius, many haloes over- 
lapped, providing a sizeable 2h term to subtract off from 
(, gm - Although all the haloes had identical shapes, we pre- 
served the number of particles in each halo from the sim- 
ulation by varying the density profiles in the mock catalog 
by multiplicative constants. The haloes ranged in particle 
number from 821 to 10222 particles, with a dimensionless 
mean-square mass fi = 1.39, for a total of 917501 particles. 
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Figure 2. An NFW density profile, with a scale radius of 
1 h^ 1 Mpc and truncated at 8 Mpc, along with various at- 
tempts to recover it from a mock catalog with haloes with the 
profile. In the top panel, the solid black curve shows the total 
1 + £gm, and the solid grey (online, green) curve shows the one- 
halo l + The dashed grey (online, magenta) curve shows our 
attempt to isolate the lh term, assuming y = 1 in the smallest- 
wavenumber bin, and the dotted grey curve shows the same as- 
suming y = 0.8 in the smallest-wavenumber bin. The bottom 
panel shows the ratio of the respective curves in the top panel with 
the true NFW profile. The curves disappear for r > 8 h~ l Mpc 
(actually, 6 h~ 1 Mpc, the centre of the bin which goes up to 
8 h^ 1 Mpc) because the input profile is truncated there. 



We used the following procedure to separate the lh and 
2h terms of £ o m . First, Fourier -transform £ 9m and £ 99 [e.g. 
using FFTLog llHamiltonll2000l) l . and then solve for y gm in 
eq. 1261 . To obtain £ 9 „, Fourier-transform P g ^ = ylm/n 
back into real space. Doing this requires an estimate of 
the large-scale bias bo- If the size of the haloes is small 
compared to the volume of the sample, it is safe to as- 
sume that y gm (kmin) is of order unity, where fc m i n is the 
smallest wavenumber in the power spectrum. An upper 
limit, and quick estimate, of feo comes from assuming that 
y g m{kmin) = 1. If both £ 99 and £ 9m are measured well out to 
linear scales, 60 may be measured directly from their ratio. 

In Fig.|5J the full l+£ 9m overestimates the NFW profile 
by almost a factor of 2 at the largest scales, whereas l + £ 9 m 
reproduces it much better. The full and lh £ 9 m's start to 
diverge at about r = 0.7 h~ l Mpc, which is the separation 
of the closest pair of galaxies, where 1 + £ 9 Jjj starts to be 
positive. Our initial try of y 9m (femin) = 1 (giving bo = 0.97) 
subtracted off most of £ 9 ™, but using y 9m (fc m i n ) = 0.8 (giv- 
ing 60 = 0.72) resulted in a better fit. Thus, even though the 
halo diameters were only 1/16 of the box size I, evidently 
J/ 9 m(fc m m = 27r/Z) did not quite reach unity. 
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Figure 3. Testing eqs. 1341 and 1351 for R 2 (k). Density pro- 
files p(r) ~ r~ 2 were put around galaxies such that the cen- 
tral densities were held fixed, but the haloes grew in radius, 
proportional to their masses. The dotted line is at 1/fi = 0.5, 
and the dashed curve is approximation 1351 . The solid curves 
depict R 2 for different halo radii per particle. The halo radius 
per particle varies by factors of \/l0 from 10~ 5,5 (rightmost 
upturn) to 10~ 25 h~ 1 Mpc (leftmost upturn). For the set of 
haloes constructed with 10 — 2,5 h _1 Mpc per particle, the min- 
imum and maximum halo radii were 0.36 and 16 h" 1 Mpc. The 
dot-dashed curves show the approximations given by eqn. 1341 . 
with (ygm/ymm) 2 calculated analytically. 



5.1.2 Behaviour of the cross-correlation coefficient 

This section describes a test of the galaxy-halo model ap- 
proximations for the squared cross-correlation coefficient R 2 , 
eqs. 1341 and 1351 1. We wanted to gauge the accuracy of the 
approximation, as well as investigate how halo profiles affect 
R 2 in general. For this test, the galaxy-halo profiles for all 
masses had a fixed density at each radius, but the trunca- 
tion radius varied with mass. We used a convenient density 
profile, p(r) ~ r~ 2 , for which the truncation radius is pro- 
portional to mass. 

For this test, the galaxy positions were those of the 
centres of the 4132 largest VOBOZ haloes exceeding 2a in the 
256 ft -1 Mpc simulation described above. The haloes ranged 
in mass from 230 to 10222, enough to give a dimensionless 
mean-square mass p — 2. The smallest galaxy separation 
was 0.113 h- 1 Mpc. 

Figure|H]shows the results of varying the halo radius per 
particle from 10" 5 ' 5 to 10" 2 ' 5 h' 1 Mpc. The power spectra 
Pgg, Pgm and P mm were calculated from 3D FFTs of the 
galaxy and matter distributions, reaching small scales by 
'folding' the particle distribution by factors of two (Klypin, 
private communication). For each fold, the boxes were split 
into eight octants, and each octant was superposed together 
in a box of half the size; thus, each fold enabled the FFT 
to reach scales smaller by a factor of two. At large scales, 
R 2 ~ 1 because with a large window function, many galaxy 
haloes are averaged over. The R 2 curves then descend with 
k as eq. 13511 predicts, but then turn up at about the scale 
(ir/r) of the largest halo radius, and finish ascending at 
about the scale of the smallest halo radius. It makes sense 
that R 2 « 1 at small scales where the haloes are identical. 



Knowing everything about every halo makes it possible to 
calculate y g m/ymm analytically; putting this into eq. 1341 
brought the approximation quite close to the measured R 2 . 
For some reason, the measured R 2 curves did not quite reach 
unity, as the analytical curves would predict. Downturns, 
only visible here for the two largest halo radii per particle, 
occur at about the scale of the tightest matter pair. Such 
downturns would not occur in the real Universe, which has 
much higher 'resolution.' 

5.2 Simulations 

The following tests involve more realistic density fields, 
drawn from simulations. The tests evaluate the galaxy-halo 
model's descriptions of the cross-correlation coefficient and 
the bias between galaxies and matter. 

5.2.1 The cross-correlation coefficient approximation 

In this section, we discuss a comparison of the predictions of 
approximation 1351 for the squared cross-correlation coeffi- 
cient R 2 (k) with measurements from simulations. The test 
also investigates the degree to which it matters if orphans 
(particles not gravitationally bound to haloes) are included 
in the calculation. The galaxy positions were drawn from the 
centres of vOBOZ-identified haloes in a suite of four ACDM 
simulations described by NHG. 

Figure 2] shows R 2 for various halo catalogs, using dif- 
ferent box sizes (32, 64, 128, and 256 h' 1 Mpc), and dif- 
ferent lower mass thresholds. An additional threshold of 2a 
in VOBOZ halo probability eliminated many spurious haloes. 
Measurements of R 2 , both including and excluding orphans 
(see sect. 14. 3L are shown. Orphans make a significant differ- 
ence in R 2 with a high halo mass threshold, but not other- 
wise. Orphans would likely make a greater difference if there 
were an upper mass threshold as well, since most of the pairs 
comprising P gm and P mm lie in the largest haloes. 

Figure [I] also shows the approximation of eq. 1351 . For 
the dashed curves, we measured the dimensionless mean- 
square halo mass fi using the VOBOZ particle halo member- 
ship. Some particles belong to more than one halo in VOBOZ; 
we removed this ambiguity by assigning each particle to the 
smallest-mass VOBOZ halo containing it. 

In Fig. |1] especially for low mass thresholds, the R 2 
curves do not reach their characteristic small-scale value as 
predicted from the VOBOZ fi. The dashed curves use a re- 
duced n' , in which the masses of haloes in clusters are equal- 
ized in an extreme way. (We define a cluster to be a set of 
haloes such that each halo is within the half-mass radius of 
another halo in the cluster.) For // , the mass of each halo 
in a cluster is set to the mass of the parent halo (the largest 
halo in the cluster) divided by the number of haloes in the 
cluster. This is not an unreasonable thing to do since, from 
the point of view of R 2 , it may not be appropriate to dis- 
tinguish between large parent haloes and small subhaloes. 
In a cluster environment, all R 2 sees is a group of galaxies 
surrounded by a bunch of matter; it does not know whether 
the matter is bound to parent haloes, to subhaloes, or to 
neither. 

Figure|S|shows fi and p! as a function of lower halo mass 
cut-offs. As expected, the difference between them grows 
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Figure 4. Testing approximation 1351 for the galaxy-matter cross-correlation coefficient R 2 (k) in actual simulations, for different halo 
mass cut-offs and box sizes. From left to right, the halo mass (given in particle number) cut-off decreases by a factor of eight, the same 
factor by which the particle mass decreases from top to bottom with box size. Thus, the physical halo mass is the same along diagonals 
starting from the bottom-left and going up and right. The squares show R 2 calculated using only particles in haloes as identified by 
VOBOZ, while the grey (online, green) diamonds show R 2 using all particles in the simulation. The dashed (online, red) lines show 
approximation 1351 . with the dimcnsionless mean-square halo mass fi calculated from the VOBOZ halo masses. For the dotted (online, 
blue) lines, a smaller fi' was used, calculated by identifying clusters of haloes (i.e. a halo and its subhaloes) such that each subhalo is 
within the half-mass radius of a parent halo, and then distributing the mass of the parent halo equally among the haloes and subhaloes. 
We fixed the large-scale bias bo by requiring that all of the curves line up in the largest-scale, lowest-wavenumber bin. 



with the amount of substructure, i.e. as the halo mass cut- 
off is decreased. Figure [S] also shows /i mea s as measured from 
the curves in Fig. 0] /i mca s = where -R^in ls the 

lowest value R 2 (k) attains for k < 10 /iMpc -1 . A simple fit 
to /imeaa from our simulations, shown as the dotted curve in 
Fig. is 

( M ■ V°' 4 
/Ws^l+^ 1013 - 1M J , (39) 

where M m i n is the lower halo mass cut-off of the sample. 



The modified (in an extreme fashion) [s! does agree with 
Mmcas better than the original fi, but there is still a signif- 
icant difference, which can be explained with reference to 
Fig. [3] Galaxy-halo profiles encroach into the regime where 
P gg is significant, causing an upturn before (i.e. at larger 



scales than) R 2 would have attained its smallest value. An- 
other way to look at the generic rise of R 2 at small scales is 
that, again referring to Fig. the profiles of galaxy haloes 
of different masses are more similar to each other at small 
scales than at intermediate scales. 



F igure al so sh ows an estimate of fi from a 
Sheth fc Tormenl II1999I) mass function with a lower, but not 
upper, mass limit. This estimate agrees with /i mca s about as 
well as the voboz fi estimate, ex cept at high masses. We 
also tried estimating from the ISheth fc Tormenl (Il999l) 
mass function using an upper mass limit given by the largest 
halo mass appearing in each simulation. These results are 
not shown; this procedure estimated (j meas well for high- 
mass (and therefore low-substructure) halo samples in the 
256 h~ x Mpc simulation, but somewhat poorly for lower- 
mass haloes in other simulations. 
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Figure 5. An illustration of different estimates of the dimen- 
sionless mean-square halo mass fi from four ACDM TV-body sim- 
ulations. The abcissa is the lower mass threshold used to define 
each halo sample. The curves are averages of the results from 
the four nested simulations. The positions of the squares show 
/i mcas = l/-RjL: n , where Ri,i„ is the lowest value R 2 (k) takes 
for k < 10 /iMpc -1 , as measured from the R 2 (k) curves in Fig. 
HI The solid curve shows fj, as calculated from the VOBOZ mass 
distributions. The (online, blue) dashdotted curve shows // , for 
which we reduced the dispersion by assigning all of the mass in 
clusters equally among t heir subhaloes. The dash ed curve shows 
fi as calculated from the lSheth fc Tormenl i 19991) mass function, 
using no upper mass threshold. The dotted line shows a crude fit 
to fi, given in eq. 1551 . The height of the squares roughly indi- 
cates the scatter, which was similar in all curves. For very low 
and high mass cut-offs, the averages contained fewer than four 
samples because the cut-offs could not be used in all simulations. 



5.2.2 The bias approximation 

Figure^shows the galaxy-matter bias bL sn = P gg /P m m (ex- 
cluding the shot noise from P gg ) as measured from a set of 
haloes taken from a 32 ft -1 Mpc simulation (NHG), with a 
physical mass cut-off of 3 x 10 10 ft -1 M (128 particles). On 
small scales, the measured bias increases relative to what the 
galaxy-halo model would predict if the haloes were infinites- 
imal nuggets of mass [i.e. that y m m(k) — 1 in eq. 1291 1. 
This occurs because haloes are extended objects, i.e. be- 
cause j/ mm 2 , the mean-square (weighting by the halo mass 
squared) halo profile, decreases from unity at small scales. 

For this figure, we used the VOBOZ estimate of the di- 
mensionless mean-square halo mass /i = 45 (see the bottom 
middle panel of Fig.^J. The approximate concordance of the 
two quantities in the top panel reflects the fact that y mm is, 
as expected, near unity on large scales. To get the normal- 
ization of P m m right, it was necessary to divide by 'orphan 
factors' (see section 14.31 (ph/p) 2 , where p is the total mat- 
ter density, and ph is the density of matter in galaxy haloes. 
We fitted bo — 0.27 by requiring that the result of eq. 12911 . 
after dividing by {ph/p) 2 , equal the measured bias curve in 
the largest-scale bin. Including orphan factors, the effective 
large-scale bias becomes 0.75, rather small because the mass 
cut-off is low. 

The bottom panel shows y mm 2 , the quotient between 
the measured bias and the infinitesimal-nugget prediction in 
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Figure 6. In the top panel, squares show the squared bias 
b?_ (k) (excluding the shot noise from P gg ), and the curve shows 
the result of the galaxy-halo model, eq. 1291 . assuming that haloes 
are 'infinitesimal nuggets', i.e. that ymm 

(k) = 1. The value of the 
large-scale bias bo has been adjusted so that the squares and the 
curve in the top panel agree at the largest scale plotted. In the 
bottom panel, the diamonds show the ratio of the curve in the top 
panel to the squares; this ratio measures the mean-square aver- 
age Fourier-transformed halo density profile y m m(k) 2 , assuming 
that ymm = Vrnmi as i n ec l- 1^91 . For comparison, the grey (on- 
line, red) curves show ymm(k) 2 (solid) and Vmmi^) 2 (dashed) 
from the same halo sample. (The square roots of these two latter 
curves appear in Fig. HI) 

eq. 1291 . along with measurements of (ymm) 2 an d (ymm) 2 - 
These would all lie on top of each other if the assump- 
tion that ymm = Vmm used for eq. 1291 were true. For this 
set of haloes, y m m does not particularly trace the empirical 
ymm [ e 1- 1251 1. but, conveniently, it does seem to follow the 
better-defined ymm [ ec l- 1221 1. This is not surprising, since 
the regime where y mm is interesting (i.e. not unity) is on 
small scales, where P m h m dominates P m h m- At least in this 
case, the galaxy-halo model explanation of the bias rising 
on small scales because of an effective halo profile works 
plausibly well. 



6 INTERPRETING OBSERVATIONS 

Increasingly sophisticated observations of weak gravita- 
tional lensing have recently led to hig h-signal-to-noise 
measurements of galaxy-m atter clustering (Hockstra et alJ 
l2003t ISheldon etall 12004) . Measurements of the galaxy- 
matter correlation function £ 9m have previously been in- 
terpreted in at least two fashions: by direct comparison 

with haloes in dark-matter simulations such as the ones 
■ 1 1 i 

we have used (Tasitsiomi ct al. 2004), and in the c ontext 
of the halo model jGuzik fc Seliakll2002l:ISeliak et al.ll2005al: 
iMandelbaum et al.ll2005l) . 

In this section, we illustrate how the galaxy-halo model 
can be used to extract information fr om measurements o f 
£ gm and £ on ; specifically, we use £ gm dSheldon et al.ll20(3) 
and £ gg iZehavi et al] I2OO2T) as measured from a volume- 
limited sample of luminous Sloan Digital Sky Survey (SDSS) 
galaxies. Sheldon et al. and Zehavi et al. have also made 
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more precise measurements from a larger, flux-limited sam- 
ple, but they are harder to interpret, since the luminosity 
cut-off and galaxy number density change with redshift. On 
small scales where measurements exist for £ gm but not for 
£ gg , we extrapolated £ 99 with a power law based on the two 
smallest-scale points. We also tried assuming £ ss = on 
small scales, which changed the results negligibly. 

There are at least two ways to extract useful in- 
formation from these observations with the galaxy-halo 
model. First, observations give the cross-bias b(k)/R(k); 
with a model of the galaxy-matter cross-correlation coef- 
ficient R(k), it is possible to measure the bias b(k). Second, 
it is possible to extract an effective £sm( r )i or average over- 
density profile 5(r), of haloes around galaxies, as in Figure 

E 



6.1 Measurement of bias 

To measure the bias from the cross-bias P gg (k) / P grn (k) = 
b(k)/R(k), it is necessary to estimate the cross-correlation 
coefficient R(k). With a measurement of the bias, it is then 
possible to obtain the matter power spectrum. To get R(k), 
it makes sense to use the simplest expression for R(k) the 
galaxy-halo model has to offer, eq. 13511 . The most brazen 
assumption used for this equation is that y a m(k) = y m m(k). 
As shown in Fig. [3] if halo profiles vary systematically with 
mass (which they almost certainly do in the real Universe), 
this assumption is valid only on scales larger than that of 
the largest halo. In simulations (Fig. , the approximation 
is good for k < 3 h Mpc" 1 , which makes sense since clusters 
have real-space sizes < 1 h^ 1 Mpc ~ t/(3 ftMpc" 1 ). If an 
accurate model for y gm /y m m is added, it would allow an 
accurate estimate of R 2 on smaller scales, using eq. 1)340 . 

Eq. I|350 allows estimation of R 2 (k) with four ingre- 
dients: the galaxy power spectrum P, the galaxy number 
density n, a large-scale bias bo, and the dimensionless mean- 
square halo mass fi. The first two of these are known from 
the £ gg measurement, but /i and bo are not. 

Figure suggests that the best way to estimate /j, is 
to measure it from haloes in a simulation. We detected 
haloes with VOBOZ in the same 256 h~ x Mpc simulation 
as used for Fig. 0] at redshift 0.1; the redshift of the ob- 
served sample varies between 0.1 < z < 0.174. In a list 
of the haloes exceeding a 2a probability threshold, the 
most massive 10028 haloes gave the same number density 
(6 x 10 -4 h 3 Mpc -3 ) as the observed sample. The haloes 
ranged in particle number from 111 to 8364 (physically, from 
9 x 10 12 to 7 x 10 14 h' 1 Mq), giving fj, = 2.2. 

The correlation functions t; gg and £ gm from this simply 
defined set of haloes agr ee quite well with the ir observed 
counterparts. Previously. iTasitsiomi et, alJ J^004) compared 
the observed correlation functions to those of sets of haloes 
in their own simulations. Using a simple halo mass cut-off 
such as ours, their simulated correlation functions were sig- 
nificantly higher than the observations. However, by using 
a reasonable scatter in the mass-luminosity relation of dark 
matter haloes, they were able to lower the theoretical cor- 
relation functions to match the observations more closely. 
Such a 'fuzzy' halo mass cut-off lowered the correlation func- 
tions by allowing smaller-mass (and more weakly clustered) 
haloes into the sample. We do not fully understand why our 
correlation functions were lower (and thus able to reproduce 



the observations using a simple mass cut-off), but we sus- 
pect it might be explained largely from the small value of 
<T8 = 0.63 used in our simulations. 

It would be useful to estimate fi without taking the time 
to run and analyse a simulation. One alt ernative might be 
to es timate /i from a mass function (e.g. ISheth fc Tormenl 
1999). However, there is no reason to expect this to work 
perfectly, since the haloes in such a mass function are 'largest 
virialized' haloes as in the standard halo model. In Fig.Qj] we 
show how well one attempt at estimating /i from this mass 
function works; it gives /i to within a factor of 2 or so. It 
may be possible to improve this guess by fixing an upper halo 
mass cut-off in addition to the lower mass cut-off we used. 
Another way to improve the /i estimate might come from, for 
example, combining a halo mass function, a subhalo mass 
function, and a halo occupation distribution, but with all 
of these ingredients, the galaxy-halo model would approach 
the complexity of the standard halo model. 

What about the large-scale bias bo? If P gm and P gg are 
measured well into the linear regime, where one is confident 
that y gm — 1, then bo may be measured directly from P gm = 
P/bo + n . However, at present P gm is not measured to 
such large scales, so to analyze the present observations, it 
is necessary to make an educated guess for bo. 

Given the dimensionless mean-square halo mass /it, 
putting bo = 1/m in eq. 135H gives the largest-possible R 2 (k) 
at each k, giving 



R b =l/» 



1/H + nP 
1 + nP 



(40) 



The smallest-possible R 2 (k) occurs when bo — * oo on large 
scales where nP > 1/fi, and when bo — > on small scales 
where nP < 1/fJ.. These minimum values of R 2 are 

(41) 
(42) 



Rbn 



= [l + l/(nP)]" 
Rl ^o = [tx(nP + l)}- 



Since in the present C£iS6, Pgm IS unavailable on confi- 
dently linear scales, the best way to get bo seems to be, again, 
to find it from a simulation. The bo we used comes from fit- 
ting eq. I|35|l to the actual R (k) in the lowest-wavenumber 
bin, giving bo = 1.12. With no other information, a reason- 
able zeroth-order guess would be bo = 1. 

Figure Q shows how R 2 (k), as calculated with eq. 13511 . 
varies with bo. It is unlikely that bo would wander by 
more than a factor of two or so from unity. In the lowest- 
wavenumber bin, R varies only by a factor of ~ 1.2 as bo 
varies between 1/2 and 2. The measured R 2 from the simu- 
lation appears as the dashed curve. 

Figure |H| shows the bias, both including and excluding 
the shot noise from P gg , as inferred by dividing R/b by R 
from eq. 1351 . We plot 1/b instead of b (as Sheldon et al. do) 
because the error bars are larger in P gm than in P gg . The 
squares are the raw observations, R/b, and the dashed line 
shows the result after dividing this by R. In calculating R, 
H — 2.2 and bo = 1.12 are fixed, but the thick error bars 
floating in the upper-left corners show the largest fluctua- 
tions (which occur at the largest scales) in l/b_ sn and 1/b 
if bo is multiplied and divided by 2. 

The thin, larger error bars are the observational error 
bars in 1/b, propagated through from £ gm and £ gg . The er- 
ror bars on P gm , denoted 8P gm , are obtained by putting 
the covariance matrix of £ flm (which Erin Sheldon kindly 
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Figure 7. An illustration of how the galaxy-matter cross- 
correlation coefficient R 2 (k), calculated using eq. 1351 . depends on 
the large-scale bias parameter 60 . The galaxy spectrum P is taken 
from from a volu me-limited sample of luminous SDSS galaxies 
iZehavi et al.ll2002l) . The dimensionless mean-square halo mass 
fx is fixed at 2.2. Solid curves are labelled according to the val- 
ues of fto used to calculate them. For realistic halo samples, bo 
should seldom stray outside the range 1/2 < bo < 2; in extreme 
cases, it might venture out to 1/4 or 4. The curve at bo = 1/2 is 
nearly identical to the highest-possible R 2 (k) for each k, which 
uses bo = 1/fJ, ~ 0.45. The dashed curve shows R 2 measured 
from a simulation described in the text. The dashed curve shows 
R 2 from eq. 1351 . using bo = 1.12, the value of bo which gives a 
curve matching the dashed curve in the lowest-wavenumber bin, 
at k m 0.5 /iMpc" 1 . 



provided to us) through a two-dimensional FFTLog. Unfor- 
tunately, rigorous error bars have not been measured for P gg 
for this sample. As suggested by Idit Zehavi (private com- 
munication), we crudely estimated the error bars on £ 99 by 
assuming that in each bin, the fractional error (5£ 99 )/£ 99 is 
the same as that in the angular galaxy correlation function 
(8wp)/w p , whose error bars have been measured. To esti- 
mate 5 Pgg, we put both £ 99 ± 5£ 99 through FFTLog; we set 
s Zgg = [F(€gg + s Zgg) " F ltgg - <^ 99 )]/2, where F denotes 
a Fourier transform. The crudeness of SP gg is not terribly 
worrisome, though, since (5P gm )/Pgm > {SP gg )/P 3 g. 

The solid curves in Fig. [8] show the bias as measured 
from the simulation. There is good agreement between the 
dashed and solid curves on large scales, k < 3 /iMpc -1 , 
where eq. 13511 works reasonably well. It should be kept in 
mind that comparing the dashed to the solid curves tests 
not the galaxy-halo model, but how well the set of haloes 
chosen from the simulation represents the observed galaxies. 



6.2 Measurement of average halo density profiles 

Now we describe a measurement of an average halo density 
profile from these observations of £ 99 and £ 9m ; for a descrip- 
tion of the procedure, see section I5T1 

Figure|5|shows a splitting of the observed galaxy-matter 
correlation function £ 9 



e g t = 8(r), 



, 9m into a one-halo term, s ,,„ 
and a two-halo term, calculated using the best-fitting 
bo — 1.12 from the previous section. The one-halo £ 9 h 



»gm dif- 
fers from the full £ 9m slightly on large scales, r > 1 ft -1 Mpc. 
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Figure 8. An attempt to measure the bias b(k) (with the 
shot noise included in P 99 ) and 6_ sn (fc) (without the shot noise 
in P 99 ) of a volume-limited sample of luminous SDSS galax- 
ies. Because the error bars are larger in P gm , we put it in 
the numerator, showing 1/6 instead of b. The squares show the 
cross-bias R(k)/b(k) = P am (k) /P 00 (k), measured from £ 9m (r) 
iSheldon et al.l2004l) and £ 99 (r) iZehavi et al.l2002l) . The dashed 
curves show 1/6 as calculated by dividing R/b by R as calculated 
from eq. 1351 . and as shown in Fig. For the dashed curves, 
we used a dimensionless mean-square halo mass ji = 2.2 and a 
large-scale bias bo = 1.12. The isolated, thick, grey (online, green) 
error bars in the upper-left corners of each panel show the largest 
fluctuation (in the smallest-wavenumber bin) in 1/6 and l/6_ an 
and if bo varies by factors of two in both directions from 1.12. 
The thin, black error bars are the observational error bars, prop- 
agated through the analysis. The (online, blue) solid curves are 
1/6 measured from a halo catalog in a simulation; the halo cat- 
alog has a lower mass cut-off giving the same number density as 
the observed galaxy catalog. 



Evidently, there is not much overlap between haloes on the 
scales measured, which is not surprising since the high halo 
mass threshold precludes a large subhalo fraction in the sam- 
ple. However, there is only a significant signal in £ gm on 
scales r < 1 h^ 1 Mpc, limiting the signal in £ 9 „ and £ 9 J5j as 
well. 

What does an 'average halo profile' really mean? In sec- 
tion all of the halo profiles were identical (up to a mul- 
tiplicative constant). In the real Universe, though, there are 
haloes of different sizes, shapes, and environments. The pro- 
cedure in the present section gives the average halo profile 
under a partition of dark matter (including orphans) into 
haloes such that j/ 9 ^ = y 9 m- I n such a partition, halo pro- 
files do not depend systematically on the clustering of their 
galaxies. Although this equality holds fairly well in simu- 
lations, it cannot hold exactly, since in the real Universe, 
both halo profiles and the amplitude of the galaxy power 
spectrum depend on halo mass. However, the question re- 
mains: is it possible to partition the dark matter in the real 
Universe into 'haloes' around galaxies in a physically mean- 
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Figure 9. An attempt to recover the average overdensity profile 
of haloes around galaxies in a volume-limited sample of luminous 
SDSS galaxies from the galaxy-matter correlation function, using 
the galaxy correlatio n function. The squar es with error bars are 
the measurement by ISheldon et all {2004) of § flm (plus 1), and 
the dark grey (online, blue) solid curve is the effective one-halo 
term £gjj,, or 5(r) (plus 1). This curve becomes dotted, and goes 
to the bottom of the plot, where fgj^ + 1 < 0. The thick error 
bars around data points in the one-halo curve show the results 
of varying the large-scale bias bo by factors of 2 of the canonical 
value, bo = 1.12, in both directions. The light grey (online, green) 
curve is the two- halo term, ^{jj + 1, again with thick error bars 
showing the fluctuation it experiences as bo changes by factors of 
2. 

ingful, if somewhat artificial, way to ensure that y^ = 
If there is, then our procedure to isolate the \h and 2/i terms 
of £ gm gives the precise average halo profile if the dark mat- 
ter is partitioned in this way. 



7 CONCLUSION 

This paper introduces a new, galaxy-halo model of large- 
scale structure. It is related conceptually to the standard 
halo model of large-scale structure, but there are significant 
differences. In the standard halo model, haloes are the fun- 
damental objects; galaxies are placed within them accord- 
ing to the halo mass. In the galaxy-halo model, galaxies are 
the fundamental objects, which have (galaxy) haloes around 
them. 

One result to come out of the galaxy-halo model is a 
deeper understanding of the galaxy-matter cross-correlation 
coefficient R in terms of a halo mass dispersion. Equation 
13511 , using a few inputs (the galaxy power spectrum, a large- 
scale bias, and a dimensionless measure of the scatter in the 
halo mass), gives an approximation for R(k) which seems 
accurate on mildly non-linear scales, k < 3 hMpc -1 . With 
this model for R, it becomes feasible to measure the galaxy- 
matter bias down to scales k < 3 /iMpc -1 from measure- 
ments of the galaxy and galaxy-matter power spectra (or 
correlation functions) , and thereby to infer the matter power 
spectrum down to these scales. 

This equation for R(k) has the following intuitive expla- 
nation. On small scales, the measurement of R(k) samples 



at most one galaxy at a time. A scatter in halo mass thus 
naturally produces a scatter in the galaxy density-matter 
density relationship, producing a small value of R(k) (this 
value depends on the spread in halo masses) . On large scales, 
many haloes are averaged over to measure R(k), reducing 
the scatter in the galaxy density-matter density relationship 
and forcing R(k) toward unity. 

The galaxy-halo model also provides a technique for 
inferring average halo density profiles, given measurements 
from a galaxy sample of the galaxy and galaxy-matter corre- 
lation functions. It is really the one-halo term of the galaxy- 
matter correlation function which corresponds to a average 
halo density profile; we present and test an algorithm to 
isolate this term. 

Another application of the galaxy-halo model is to the 
bias b 2 (k) = P gg (k) / P mm (k) between galaxies and matter. 
If the shot noise is excluded from the galaxy power spec- 
trum, the bias generally dips down on intermediate scales 
where the one-halo and two-halo terms of the matter power 
spectrum are comparable. On small scales, the bias gener- 
ally increases with wavenumber because of haloes' extended 
(not pointlike) density profiles, which cause a downturn in 
the matter power spectrum. 
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